6 Concluding Remarks 5.2 Incorporate Database Categorization 5 Improving the Retrieval of Relevant Documents = 2:0m 2 2 2 2 2 2 2 2 2 2 2 = 2:0m 2 2 2 2 2 2 2 2 2 2 2

نویسندگان

  • C. Yu
  • K. Liu
  • W. Wu
  • W. Meng
چکیده

In this paper, we proposed a new, highly scalable method to solve the database selection problem when the number of local databases in a metasearch engine is very large. For single-term queries, this method guarantees all desired documents to be retrieved. Experiments were conducted to show that this method is very eeective for typical short Internet queries. We also illustrated that the our database selection and collection fusion method can be incorporated with other techniques to improve the retrieval eeectiveness. We plan to continue this research in the following directions. (1) Further improve the accuracy of our method. Study in 38] indicates by incorporating certain dependencies among adjacent query terms, the accuracy can be further improved. One research issue is how to incorporate such dependencies in the integrated database representative. (2) Extend our method that works very well for short Internet queries so that it can also work well for longer queries. a given query q if and only if MDR(q; D 1) > MDR(q; D 2) > ::: > MDR(q; D N), where MDR(q; D i) is the degree of relevance of the most relevant document in database D i with the query q. The proof of Proposition 3 and the description of a method for estimating the degree of relevance of the most relevant document in a database can be found in 41] and will not be repeated here. It was reported in 41] that the method yields retrieval eeectiveness which is very close to that as if all documents were placed in one site (cor iden doc ranges from 88% to 98%.) As mentioned earlier, one reason for Internet queries to have low retrieval eeectiveness is because Internet queries are usually very short and short queries do not provide enough context words to determine the meanings of their terms. One technique to address this problem is to rst assign queries and databases to appropriate topics or concepts and then limit the search of databases, with respect to a given query, to only those databases that share the same concepts as those assigned to the query. The idea is to utilize these concepts as context to help disambiguate the meanings of terms in databases and queries, and as a result, leading to the improvement of retrieving relevant documents. Several researchers have studied the idea of assigning databases to concepts and/or clustering documents into new databases to improve retrieval …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FREZCHEM: A geochemical model for cold aqueous solutions

#,-(!!/%&%*0!1%&%)*23!4-&0#050%6!77.8!1)99#,!:)*;<)=6!1%-,6!>[email protected]!9#$%&(B)*#,-CD*#(%D5(!! ! ! Introduction: E1FGHIF'!#&!)-!%J5#$#K*#5B! 23%B#2)$!03%*B,D=-)B#2!B,D%$!L)*)B%0%*#M%D!N,*! 2,-2%-0*)0%D!%$%20*,$=0%!&,$50#,-&!O0,!#,-#2!&0*%-903&!P! 7Q!BR!5&#-9!03%!:#0M%*!)LL*,)23!S.+7T!N,*!03%!0%BL%*)+ 05*%!*)-9%!N*,B!UVWQ!0,!78XH6!)-D!03%!L*%&&5*%!*)-9%! N*,B!.!0,!.QQQ!K)*&!SYT(!!Z3%!25**%-0![%*&...

متن کامل

Adaptive Job Scheduling Via Predictive Job Resource Allocation

#$%&'%('! )*+! ,-./'012&3! 0,/,! ,$%$2-! )*+! ,24/,! 5.2-.! 1%-6,! 71/82+212$9! (/3%('2&3! -.%&32&3!1*%'!2&!$./!,9,$/:!%&'!7(%3:/&$%$2*&!.%&'12&3;!<'%=$2>/!(/,*0(-/!%11*-%$2*&!2,! 6&*5&! $*! =(*>2'/! $./! 71/82+212$9! &//'/'! $*! *+$%2&! +/$$/(! (/,=*&,/! $2:/,! 0&'/(! ,0-.! -*&'2$2*&,;! ?/! =(/,/&$! %! ,-./'012&3! %==(*%-.! @#ABCBDEF! 5.2-.! '/-2'/,! (/,*0(-/! %11*-%$2*&!%$!)*+!,$%($!$2:/!%&'!...

متن کامل

The effect of sinusoidal rolling ground motion on lifting biomechanics.

The objective of this study was to quantify the effects of ground surface motion on the biomechanical responses of a person performing a lifting task. A boat motion simulator (BMS) was built to provide a sinusoidal ground motion (simultaneous vertical linear translation and a roll angular displacement) that simulates the deck motion on a small fishing boat. Sixteen participants performed liftin...

متن کامل

Sinking in a Sea of Pixels— The Case for Pixel Fusion

...........................................................................................................................................1 The Success of Virtual Environments ..........................................................................................1 Issues of Increased Complexity ....................................................................................................

متن کامل

Radiological Aspects of Genetic Disorders with Adult-onset CNS Symptoms.

!"!#$%& '$()*'!*(& +,,!%#$"-& #.!& %!"#*+/& "!*0)1(& (2(#!3& .+0!& +& 4$'!& +-!& *+"-!& *!-+*'$"-& )"(!#& ),& (235#)3(6&7& (5!%$,$%&'$(!+(!& !"#$#2&3+2&.+0!& %.$/'.))'&)"(!#& )*&+'1/#& )"(!#& ,)*3(8&4.!*!+(&)#.!*& '$(!+(!&!"#$#$!(&3+2&)"/2&2$!/'&(235#)3(&$"&+'1/#.))'6&9235#)3(&3+2&:!&"!1*)/)-$%+/&)*&5(2%.$+#*$%& $"%/1'$"-&!+*/2&'!3!"#$+6&;#&$(&$35)*#+"#&#)&*!%)-"$<!&(1%.&'$(!+(!(&:!%+1(!&#.!&%)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007